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Given n independent random marked d-vectors (points) Xi distributed with a common density, 
define the measure v n = ■ where £j is a measure (not necessarily a point measure) which 
stabilizes; this means that £j is determined by the (suitably rescaled) set of points near Xi. For 
bounded test functions / on R d , we give weak and strong laws of large numbers for u„(f). The 
general results are applied to demonstrate that an unknown set A in d-space can be consistently 
estimated, given data on which of the points Xi lie in A, by the corresponding union of Voronoi 
cells, answering a question raised by Khmaladze and Toronjadze. Further applications are given 
concerning the Gamma statistic for estimating the variance in nonparametric regression. 
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1. Introduction 

Many interesting random variables in stochastic geometry arise as sums of contributions 
from each point of a point process X n comprising n independent random d- vectors Xi , 1 < 
i < n, distributed with common density function. General limit theorems, including laws 
of large numbers (LLNs), central limit theorems and large deviation principles, have been 
obtained for such variables, based on a notion of stabilization (local dependence) of the 
contributions; see [16, 17, 18, 20]. In particular, Penrose and Yukich [18] derive a general 
weak LLN of the form 

n 

^n-^n^i . . . , n^X„}) -^0, (1.1) 

i=l 

where £(x; X) is a translation-invariant, real-valued functional defined for all finite X C 
R d and x & X , with £ satisfying stabilization and (1 + e)th moment conditions. The result 
of [18] also provides information about the limiting constant (3. 

Numerous applications of (1.1) are given in [18], for example, to sums of the form 
Yl e eG(x n ) 4>( nl ^ d \ e \), where <f> is a test function satisfying a growth bound, G(X n ) is (for 
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example) the nearest neighbour or Delaunay graph on X n and |e| denotes the length 
of edge e. For such sums, Jimenez and Yukich [9] give a strong LLN. Other examples 
considered in [2, 18] and elsewhere include those concerned with Voronoi graphs, minimal 
spanning trees, germ-grain models and on-line packing. In the present paper, we shall 
give general results extending the basic limit theorem (1.1) in the following directions. 

1. Almost sure convergence. Under a moments condition and growth bound on the 
increment in the left-hand side of (1.1) on addition of a further point (the so-called 'add 
one cost'), this result can be extended to a strong LLN. 

2. Convergence of measures. Consider the random measure on K d comprising a point 
mass at each point Xi of X n , of magnitude given by the ith term in the sum in the left- 
hand side of (1.1). This measure keeps track of the spatial locations of the contributions 
to the sum. Its asymptotic behaviour has been considered recently in [2, 7, 14, 20]. In fact, 
it is not necessary to restrict oneself to a point mass at Xi and one can generalize further 
by considering the case where the contribution of the ith term to the random measure 
is some measure determined by (Xi,X n ) and localized 'near' to Xi in some sense; for 
example, the 1-dimensional Hausdorff measure on the graph G(X n ) takes this form. We 
provide LLNs for the integrals of appropriate test functions against such measures. 

3. Non-translation-invariance. It turns out that the translation-invariance condition 
on £ (which will be defined formally later) can also be relaxed in the limiting result (1.1) 
if, instead of scaling the point process X globally, as in (1.1), one scales the point process 
X n locally at each point Xi to keep the average density of points bounded, as n becomes 
large. 

4- Marked point processes. In a number of examples in the literature, the points of the 
point process X n are required to carry independent identically distributed marks in some 
abstract measure space (the mark space). 

We state our general results in Section 2 and prove them in Section 4, using auxiliary 
results on weak convergence given in Section 3. In Section 5 (which can be read without 
consulting Sections 3-4), we illustrate our general results with two specific applications in 
nonparametric statistics. Many other fields of application have been discussed elsewhere 
[2, 16, 18] and we do not attempt to review all of these, but our results often enable us 
to generalize results in those papers as indicated above. 

The first application in Section 5 is to a question raised by Khmaladze and Toronjadzc 
[11], motivated by statistical image analysis. Suppose A is an unknown subset of the unit 
cube in M. d and that the random vectors Xi represent 'sensors' which are scattered at 
random over the unit cube. Each sensor can detect whether it lies in the set A or not. 
A reasonable estimate A n of the unknown set A, given the binary data from n sensor 
locations X n , is then given by the union of the Voronoi cells, relative to the random 
point set X n , of those points Xi which lie in A. Essentially, the question raised in [11] is 
whether or not A n is a consistent estimator for A as n — > oo ; we answer this affirmatively 
via the strong LLN for measures. 

Our second application is to a nonparametric regression model. Suppose that with 
each point Xi S R d , we have an associated real- valued measurement Yi, related to Xi by 
Yi = h(Xi) + ei. Here, we assume that the function h G C^R^R) is unknown and that 
the independent error has mean zero and unknown variance a 2 . For a large observed 
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sample (AQ, Yi)™ =1 , a possible estimate for a 2 is given by Wi := (Xj(i,n,i) ~ ^i)/2j where 
j(i,n, 1) is chosen so that Xjn^^ is the nearest neighbour of AQ in A"„. This estimator 
is approximately unbiased for n large since then W, « (e^^i) — ei) 2 /2. 

The mean VF of IFi , . . . , W n is the so-called Gamma statistic [5] based on nearest 
neighbours; one can consider a similar statistic based on fcth nearest neighbours. We 
shall use our general results (requiring non-translation-invariance and marked points) to 
derive large-sample asymptotic properties associated with Gamma statistics. 

2. Notation and general results 

Let (M.,J-Mi fJ-M) DC a probability space (the mark space). Let £(x;A", .A) be a Borcl 
measurable R- valued function defined for all triples (x, X, A), where X C R d x M. is 
finite and where x = (x,t) G X (so x G R d and t & M.) and A is a Borel set in R d . We 
assume that £(x; A 7 ) := £(x; X, •) is a a-finite measure on R d . 

Suppose x=(i,i)eK [! xM and X C R d x M is finite. If x £ AT, we abbreviate and 
write f (x; A") instead of £(x; A" U {x}). We also write A" x for X U {x}. Given y G R d and 
a G R, we set y + ax := (y + ax,t). Let y + aA' := {(y + aw) : w G A"}; in other words, scalar 
multiplication and translation act only on the first component of elements of K d x M. 
For A C R d , we set y + aA = {Y + aw : w G A}. We say £ is translation invariant if 

Z(x;X 1 A) = ay + x;y + x,y + A) 

for all y G R d , all finite X C R d x M and x G AT and all Borcl A C R d . 

Let k be a probability density function on R d . Abusing notation slightly, we also let k 
denote the corresponding probability measure on R d , that is, we write k(A) for J A k(x) dx, 
for Borel A C R d . For all A > 0, let Ak denote the measure on R d with density Ak(-) and 
let V\ denote a Poisson point process in R d x M with intensity measure Ak x . 

Let (X,T), (X',T ; ), (Xi,Ti), (X 2 ,T 2 ), . . . denote a sequence of independent identically 
distributed random elements of R d x A4 with distribution k x /!» and set X := (X, T), 
X' := (X', T") and Xj := (X ; , I 1 ;), i > 1. For n G N, let X n be the point process in R d x M 
given by A" n := {Xi, X 2 , . . . , X„}. Let Tt\ denote a Poisson point process in R d x A4 
with intensity A times the product of <i-dimcnsional Lebcsguc measure and \im (i-c, 
a homogeneous marked Poisson process in R d with intensity A) and let 7i\ denote an 
independent copy of H\. 

Suppose we are given a family of non-empty open subsets fl\ of R d , indexed by A > 1, 
that are non-decreasing in A, that is, satisfying fl\ C Q\, for A < A'. Denote by Qao the 
limiting set, that is, set fl^ := Ua>i Suppose we are given a further Borel set fl with 
floo C C R d . In many examples, one takes £7a = for all A, either with f2 = R d or with 
k supported by f2. 

For A > and for finite X C R d x M with x = (x, t) G X and Borcl A C R d , let 



Ca(x; A", A) := £(x; z + A x / d (-z + A"),x + A 1 /^-^ + A))1 0a (x). (2.1) 
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When £ is translation invariant, the rescaled measure £\ simplifies to 

G(x; X, A) = CCA^x; \^ d X, X^A)!^ x M (x). (2.2) 

In general, the point process x + A 1//d (— x + X) is obtained by a dilation, centred at x, of 
the original point process. Loosely speaking, this dilation has the effect of reducing the 
density of points by a factor of A. Thus, for x = (x,t) G M. d x A4, the rescaled measure 
£\(yi;X,A) is the original measure £ at x relative to the image of the point process X 
under a dilation about x, acting on the image of 'space' (i.e., the set A) under the same 
dilation. This 'dilation of space' has the effect of concentrating the measure near to x; 
for example, if £(x; X) is a unit point mass at x + y{x) for some measurable choice of 
function x ^ y(x) G M. d and £l\ = M. d , then £\(x,t;X) would be a unit point mass at 
x + \- 1 / d y{x). 

Our principal objects of interest are the random measures v\. n on R d , defined for 
A > and n G N by v\_ n := Y27=i £aP^»> %n). We study these measures via their action 
on test functions in the space B{Vl) of bounded Borel measurable functions on fl. We let 
B(il) denote the subclass of B(il) consisting of those functions that are Lebesgue-almost 
everywhere continuous. When ^ M. d , we extend functions / G B(fl) to R d by setting 
f(x) = for xeR d \fl. Given f e B(Cl), set (/,&(x;Af)) := J Rd /(z)6(x; X, dz). Also, 
set 

~ n 

(f,V\,n) ■= / fdv Kn = y2(f,&(Xi\ #„)>. (2.3) 

The indicator function ln A (x) in the definition (2.1) of £a means that only points 
Xj contribute to i^\,n- In most examples, the sets fl\ are all the same and 

often are all M. d . However, there are cases where moment conditions such as (2.5) below 
hold for a sequence of sets ft\, but would not hold if we were to take = il for all 
A; see, for example, [15]. Likewise, in some examples, the measure £(x; X) is not finite 
on the whole of K d , but is well behaved on fi, hence the restriction of attention to test 
functions in B(Q). 

Let | • | denote the Euclidean norm on K d and for x G M. d and r > 0, define the ball 
B r (x) := {y G M d : \y — x\ < r}. We denote by the origin of R d and abbreviate B r (0) 
to B r . We write B*(x) for B r (x) x M, B* for B r x M and {B*) c for (R d \ B r ) x M. 
Finally, we let ujd denote the Lebesgue measure of the d-dimensional unit ball B\ . 

We say a set X C M. d x M is locally finite if X n B* is finite for all finite r. For 
x G K d x M and Borel A C R d , we extend the definition of £(x; <Y, A) to locally finite 
infinite point sets X by setting 

£(x; X, A) := limsup£(x; XDB* Kl A). 

K->oo 

Also, for x = (x,t) G M. d x A4, we define the a;-shifted version •) of £(x; •, •) by 

(*,A)=£(x;.t + A> + A). 
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Note that if £ is translation invariant, then ^'^(XjA) = (X, A) for all x € R d , 
i G Al and Borcl A C R d . 

The following notion of stabilization is similar to those used in [2, 18]. 

Definition 2.1. For any locally finite X C R d x M. and any x = (x,t) £ R d x A4 , define 
R(x.]X) (the radius of stabilization o/£ ai x with respect to X) to be the smallest integer- 
valued r such that r > and 

f (x; x + [X n U y, fl) = £(x; x + [Af PI S*], B) 

for all finite y C (B*) c and Borel B C R d . J/ no suc/i r exists, we set i?(x; AQ = oo. 

In the case where £ is translation invariant, R((x,t): X) = R((0,t); X), so R((x,t);X) 
does not depend on x. Of particular importance to us will be radii of stabilization with 
respect to the homogeneous Poisson processes H\ . 

We assert that i?(x; X) is a measurable function of X and hence, when X is a random 
point set such as H\, i?(x; X) is an NU{oo}-valued random variable. To see this assertion, 
observe that by Dynkin's pi-lambda lemma, for any k € N, the event {i?(x; X) < k} equals 
the event ClBeB^i^^ = i{X, B)}, where B is the II-system consisting of the rectilinear 
hypercubes in R d whose corners have rational coordinates and for B S £>, we set 

s(X, B) := sup{£(x; x + ([X n Sj] U y), B) : y C (B* k ) c } 7 
i(X, B) := inf{£(x; x + ([X n BJ] U y), B) : y C (B,*) c }. 

Also, s(A",_B) is a measurable function of ^ because we assume £ is Borel measurable 
and, for any b, we have 

{X :s(X)>b} = 7T! ({(#, y) : £(x; x + [X C\ B%] U [y \ Bl\,B) > 6}), 

where 7Ti denotes projection onto the first component, acting on pairs (X,y), with <Y 
and y finite sets in R d x A(. Similarly, i(X,B) is a measurable function of X. 

For x = (i,()£l l! xM, let £J(x; A 7 , •) be the point measure at x with total measure 
Ca(x; Af, 0), that is, for Borel A C R d , let 

^(x;^,A):=eA(x;Af,n)l A (x). (2.4) 

We consider measures £ and test functions / <E S(0) satisfying one of the following 
assumptions. 

Al: £((x,t);X, •) is a point mass at x for all (x, t, X). 

A2: £(x; X,-) is absolutely continuous with respect to Lebesgue measure on R d , with 
Radon-Nikodym derivative denoted £,'{x\X,y) for y £R d , satisfying (x; X , y) < 
Kq for all (x; X,y), where Kq is a finite positive constant. 

A3: / is almost everywhere continuous, that is, / 6 B(Q). 
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Note that assumption Al implies that £J = £\> an d that assumption A2 will hold if 
£(x, X, •) is Lebesgue measure on some random subset of M. d determined by x, X. 

Our first general result is a weak law of large numbers for (/, P\ n ) defined at (2.3), for 
/ £ B(fl). This extends [18], which is concerned only with the case where / is a constant. 
We require almost surely finite radii of stabilization with respect to homogeneous Poisson 
processes, along with a moments condition. 

Theorem 2.1. Suppose that R((x,T);7{ K r x \) is almost surely finite for K-almost all 
x G rioo- Suppose, also, that f G B(f2) and that one or more of assumptions Al, A2, 
A3 holds. Let q~l or q = 2 and let the sequence (A(n),n > 1) be a sequence of positive 
numbers with X(n)/n — ► 1 as n — > oo. If there exists p> q such that 

limsupE[£ A („)(X; X n - U n) p ] < oo, (2.5) 
then, as n — »■ oo, we have the L q convergence 

n- x {f,v X (n),n)^ I f(x)nct T \n K{x) ,R d )Mx)dx as^ M , (2.6) 
with finite limit, and the L 1 convergence 

n 

n- 1 ^|(/,e A(n) (X ! ;4)-e' (n) (X 4 ;4))hO asn^oo. (2.7) 

i=l 

To extend Theorem 2.1 to a strong law, we need to assume extra conditions concerning 
the so-called add one cost, that is, the effect of adding a single further point on the 
measure V\(n),n-x- We define three different types of add one cost, the first two of which 
refer to a test function /. Given / G B(f2) and n > 2, A > 1, set A>, ; „(/) := (f,v\, n — 
v\, n -i)- Also, let G\x,n(/) be the sum in (2.7), that is, set 

n 
i=l 

and set A' Xn (f) ■= G\,n(f) ~ Ga,»-i(/). Furthermore, for 1 < i < n- 1, let |6,(X i; #„) - 
£a(X;; Af n _i)|(0) denote the total variation (i.e., the sum of positive and negative parts) 
of the signed measure £\(Xj; X n , •) — £a(X^; Af„_x, •) on Q and define A>, n by 

Tl-l 

A A ,„ :=eA(X„;A , „,n) + ^|eA(X i ;-Y„)-a(X i ;A , „_ 1 )|(Q). (2.8) 

i=l 

Given a random variable W, as usual, we let \\W\\ P := E[|lT| p ] 1//p for p = 1 and ||W||oo := 
mf{t > : P[|W| > t] = 0} with inf (0) := +oo. 
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Theorem 2.2. Suppose that R((x,T);7i K t x \) is almost surely finite for n-almost all 
x £ f2oo- Suppose, also, that f G B(tt) and that one or more of assumptions Al, A2, A3 
holds. Suppose X(n)/n — ► 1 as n — > oo and £/ia£ £/iere exists p > 1 ,suc/j £/ia£ (<?.5,) holds. 
Suppose (3 > 1 and p' > 2(/3 + 1). //, as n — > oo, we /iaue 

I|Aa ( „),„(/)IL=0(tA ||A A(B) , n (/)|| p ,=0(l), (2.9) 
£/ien f ao/ds ii;i£/i almost sure convergence. If, instead of (2.9), we have 

K (n ), n (/)IL = <V), ||A A(n))n (/)|| p ,=0(l), (2.10) 
£/ien (^.Tj ZioZds u>z£/i almost sure convergence. Finally, if 

II A A( „),„ |L=0(tA ||A A( „ )jn || p ,=0(l), (2.11) 
£/ien 6o£/i f^-^ and (2.10) hold, so both (2.6) and (2.7) hold with almost sure convergence. 

Remarks. Certain weak laws of large numbers for {/, ^xfn) n) f°U ow directly from The- 
orem 2.4 of Baryshnikov and Yukich [2]. However, the conditions in Theorem 2.1 arc 
weaker in many ways than those in [2], as one might expect, since we consider only the 
law of large numbers, whereas [2] is concerned with Gaussian limits. 

For example, in [2], attention is restricted to cases where assumption Al holds. It is 
often natural to drop this restriction. Also, in [2], it is assumed that re has compact convex 
support and is continuous on its support, whereas we make no assumptions here on re. 
Moreover, in [2], attention is restricted to continuous bounded test functions /, whereas 
we consider test functions which are merely bounded (under Al or A2) or bounded and 
almost everywhere continuous (under A3). Thus, we can consider test functions which 
are indicator functions of Borel sets A in Q. 

Our stabilization conditions refer only to homogeneous Poisson processes, and not to 
any non-homogencous Poisson processes, as in [2]; unlike that paper, we require only 
that radii of stabilization be almost surely finite, with no condition on their tails. Also, 
our moments condition (2.5) is simpler than the corresponding condition in [2] (equation 
(2.2) of [2]). 

Almost sure convergence, that is, the strong law of large numbers, is not addressed 
in [2] or [18]. Some strong laws for graphs arising in geometric probability are derived 
by Jimenez and Yukich [9] and we add to these. As in [9], we actually prove complete 
convergence, as defined in, for example, [13] or [17]. 

Unlike [2, 9, 18], we spell out the statement and proof of our law of large numbers for 
marked point processes (i.e., point processes in M. d x Ai, rather than in M. d ). This setting 
includes many interesting examples, such as germ-grain models and on-line packing, and 
generalizes the unmarked point process setting because we can always take M. to have a 
single element and then identify M. d x Ai with R d to recover results for unmarked point 
processes from the general results for marked point processes. 
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Poisson samples. It is also of interest to obtain a similar result to Theorem 2.1 for 
the random measure fi\ defined in a similar manner to v\. ni but using the Poisson point 
process V\ instead of X n , taking A — > oo. Such a result can indeed be obtained by a similar 
proof, with L 1 convergence if, instead of the moments condition (2.5), one assumes 

supE[^(X;7'A, ^) p ] < oo, for some p>l. 
\>i 

Multisample statistics. Suppose that X ni C K d represents a sample of m points of 'type 
one' and y n2 C M. d represents an independent sample of n-j points of 'type two,' possibly 
having a different underlying density function. Suppose that for i = 1,2, the function- 
al £W (x; X, y) and £^ 2 \y;X,y) are defined in a translation-invariant and stabilizing 
manner for finite X C R d , finite y C K d , x £ X and y £y. When the integers rii = 
(i = 1,2) satisfy ni(n) + n 2 (n) = n and rii(n)/n — > 7Tj G (0,1) as n — > oo, the sum 

£ (" 1/d ^ ; , n 1 /^ ) + g f (a) (n V^. . n i/^ , n V^ na ) 

»=1 3=1 

satisfies an LLN under weaker (or at least, different) conditions than those for the main 
result of Hcnzc and Voigt ([8], Theorem 2.3) for such sums (and likewise for more than 
two samples). The proof is based on the techniques of this paper and the results are 
comparable to those for independently marked points where M. = {1,2} and the 'mark' 
determines whether a point is in sample X or y . We omit the details here. 



3. Weak convergence and the objective method 

In this section, we derive certain weak convergence results (Lemmas 3.3-3.6). We use a 
version of the 'objective method' [1, 21], whereby convergence in distribution (denoted 

— >) for a functional defined on a sequence of finite probabilistic objects (in this case, 
rescaled marked point processes) is established by showing that these probabilistic objects 
themselves converge in distribution to an infinite probabilistic object (in this case, a 
homogeneous marked Poisson process) and that the functional of interest is continuous. 

A point process in R d x M. is an £- valued random variable, where C denotes the space 
of locally finite subsets of M d x M.. We use the following metric on C: 

D(A,A') = (nmx{K eN:An B* K = A' D B* K }y\ (3.1) 

With this metric, £ is a metric space which is complete but not separable. In the un- 
marked case where M has a single element, our choice of metric is not the same as 
the metric used in Section 5.3 of [21]. Indeed, for one-point unmarked sets, our metric 
generates the discrete topology rather than the Euclidean topology. 

Recall (see, e.g., [13], [19]) that x G K d is a Lebesgue point of k if e~ d J B , > \n{y) — 
n(x)\dy tends to zero as e | and that the Lebesgue Density Theorem tells us that 
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almost every i £ R is a Lebesgue point of k. For subsequent results, it is useful to define 
the region 

fio : = { x € ^oo : k(x) > 0, x a Lebesgue point of «(•)}• (3.2) 



Lemma 3.1. Suppose x G fio <w&rf suppose (y(A), A > 0) is an M. d -valued function with 
\y(X) — x\ = 0(A~ 1 / d ) as A — ► oo. Then there exist coupled realizations V' x and 'H' k m of 
V\ andTi. K r x \, respectively, such that 

D{X l l d {-y{X)+V x ),H' K{x) )^0 as A —>• oo. (3.3) 

Proof. Let H + denote a homogeneous Poisson process of unit intensity in WL d x M. x 
[0, oo). Let V' x denote the image of the restriction of TL + to the set {(w, t, s) G M. d x A4 x 
[0,oo):s < Xn(w)} under the mapping (w,t,s) *—> (w,t). Let 'H l K r x y denote the image 
of the restriction of Ti + to the set {(w,t,s) G M d x M. x [0, oo) :s < Xk(x)} under the 
mapping 

(w,t,s)^(X 1 / d (w-y(X)),s). 

By the Mapping Theorem [12], V' x has the same distribution as V\, while H', x -, has the 
same distribution as Tt K ( x )- 

The number of points of the point set 

{X 1 l d {-y{X)+V' x )AH' K(x) )nB* K 

equals the number of points (X,T,S) of Ti + with X G B x -i/ d K(y(X)) and with either 
Xk(x) < S < \k(X) or Xk(X) < S < Xk(x). This is Poisson distributed with mean 

A / \ K ( Z ) ~ K ( x )\ dz, 

JB x _ 1/dK (y{X)) 

which tends to zero because x is assumed to be a Lebesgue point of k, and (3.3) follows. □ 

In the rest of this section, given x G we write x for (x,T) (i.e., for the point x 
equipped with a generic random mark T) and, given y G R d , we write y for (y,T"). 

Lemma 3.2. Suppose (x,y) £ flo x Qq with i / j. Let (A(fc), £{k), m(k))k£K be 
a ((0,oo) x N x N)-valued sequence satisfying X(k) — > oo, and i(k)/X{k) — > 1 and 
m{k)/X{k) — ► 1 as fc — > oo. Then, as k — > oo, 

(A(fc) 1 / d (-^ + A4 (fc) ), A(£;) 1/d (-x + ^„ (fe) ), A(fc) 1 / d (-y + * m(fc) ), 
A(fc) 1/d (-^ + A* fc) ), A(fc)^(-x + ** (fc) ), X{k) 1 / d {-y + A:, x n(fc) )) 

K^«(x)! H K (y), H K (y))- (3-4) 
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Proof. In this proof, we write simply A for A(fc), £ for £(k) and m for m(k). We use the 
following coupling. Suppose we are given A. On a suitable probability space, let V and 

V be independent copies of V\, independent of Xi,X 2 , — 

Let V be the point process in M. d x M. consisting of those points (V, T) 6 V such that 
\V-x\ < \V-y\, together with those points {V',U') G V with \ V - y\ < \ V - x\. Clearly 

V I is a Poisson process of intensity Xk x onK d xA<. 

Let T~C' K ( x j and be independent copies of Fi K [ x ) and Ti-^y) > respectively. Assume 

W K r x -) an d H' K ( y ) are independent of (Xi, X2, X3, . . .). Using Lemma 3.1, assume, also, 
that Ti' K t x \ is coupled to V and H' K ( y -) is coupled to V in such a way that as k — > 00, 

max(£>(K (l) , A 1 ^-* + V)), D(H' K{y) , X^-y + £>))) -^0. (3.5) 

Let TV denote the number of points of V (a Poisson variable with mean A). Choose an 
ordering on the points of V' , uniformly at random from all TV! possible such orderings. Use 
this ordering to list the points of V' as Wi, W2, . . . , Wjv". Also, set Wjy+i = Xi, Wjv+2 = 
X2, Wjv+3 = X3 and so on. Set 

X[ := {Wi, . . . , W,}, X' m := {Wi, . . . , W m }. 

Then {X' t , X^) = (X i; X m ) and {W K ^ x yU' K ^ y) ) = (H K ( x ),'H K (y)), where = denotes equality 
of distribution. 

Let K £ N and let 8 > 0. Define the events 

E := {X' m n B* x _ 1/dK (x) =V'n Bl_ 1/iK (x)}, 
F := {(X^i-x + V)) nB* K = H' K(X) n B* K }. 

Event E occurs unless either one or more of the (TV — m) + "discarded" points of V' or 
one or more of the (m — TV) + "added" points of {Xi, X2, . . .} lies in B x _ l/dR (x). For each 
added or discarded point, for sufficiently large A, the probability of lying in B* x _ 1/dK (x) 
is at most ujd(n(x) + l)K d /X because x is a Lebesgue point. Thus, for k sufficiently large 
that |m — A| < 5X, we have 

P[E C ] <P[\N-X\> SX] + (2SX)uj d (K(x) + l)K d /X, 

which is less than Z5lo,i(k(x) + l)K d for sufficiently large k. Hence, P[E C ] — > as k — > 00. 
Moreover, by (3.5), we also have P[_F C ] — > as k — * 00. 

Assuming A to be so large that \x — y\ > 2X~ 1 ^ d K, if E n F occurs, then 

< (x) nB* K = (x 1 / d (-x+v))nB* K 

= x^ d ((~x + v)n BU, dK ) = x^i-x + (rn B* x _ 1/dK (x))) 

= X^ d (-x + (X^nBl_ 1/dK (x))) 

= x^ d ((-x + x' m ) n BU /dK ) = (\ 1/d (~x + x!J) n b* k 
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so that D(H' K ,,, \ 1/d {-x + X' m )) < l/K. Hence, for any K, we have 
P[D{K {x) XI\-x + X' m )) > 1/JfJ-O. 

Similarly, we have 

m^{P[D(H' K{x) ,\ 1/d (-x + K)) > l/K],P[D{H' K(y) ,\ 1/d (-y + ^)) > l/K], 

P[D(K {x) ,X^ d (-x + (Xtf)) > l/K],P[D(n' K{x) ,\ 1/d (-x + {X' m f)) > l/K], 

P[D{H' K{y) X ,d {-y + {x' m r)) > l/J?]}-0. 

Combining these, we have the required convergence in distribution. □ 

Lemma 3.3. Suppose (x,y) ££Iq x £Iq, with x^y. Suppose, also, that R(x;H K ^) and 
R(y;7i K r y )) are almost surely finite. Suppose (A(m)) m >i is a (0,oo)-valued sequence with 
A(m)/m — ► 1 as m — > oo. Then, for Borel A C M. d , as m — > oo, we /iaue 

£ A(m) (x; * m , z + A(m)~ 1/d A) (W„ (!B) , A) (3.6) 

and 

(( AW (x;^,i|AH- 1 ^) 1 ( AW (y;^,! / + A(m)- 1 ^)) 

■^(fio(W»W.>l) 1 S,(«,(v)^))- ( 3 - 7 ) 

Proof. Given A, define the mapping /i^ ja . : Ai x £ — > [0, oo] and the mapping h\ : (Ai x 
£ x M x £) -> [0, oo] 2 by 

h A , x (t,X)=Z({x,t);x + X,x + A), 
h 2 A (t,X,t',X') = (h AtX (t,X),h A , y {t',X')). 

Since i?(x;7Y K ( z .)) < oo a.s., the pair iT,TH. K r x )) lies a.s. at a continuity point of h AtX , 
where the topology on Ai x £ is the product of the discrete topology on Ai and the 
topology induced by our metric D on £, defined at (3.1). Similarly, (T, 7i. K ( x ), T", Ti K ^ y j) 
lies a.s. at a continuity point of h\. We have, by the definition of that 

^(xjA^.s + A- 1 /^) 

= /iAx(T,A 1/d (-a; + A' m )), 

(fc(x; ^ , x + A- 1 /^), 6(y; ,J/ + A" 1 ^)) 
= /^(T, AV^-a; + X? n ),T' , \V*(-y + X*)). 

By Lemma 3.2, we have (T, A(m) 1 / d (-x + Af ro )) -^(T, H k{x) ) so that (3.6) follows by 
the Continuous Mapping Theorem ([3], Chapter 1, Theorem 5.1). Also, by Lemma 3.2, 

(T,A(m)^(- x + ^) J T' ) A(m)^(-y + ^))^(T,W B(a) ,T , ,W /((tf) ) 
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Lemma 3.4. Suppose (x,y) £ flo x with x ^ y, that R(x;Tl K ^) , i?(y;H K ( 2/ )) ; 
£oo(^K(»j^ d ) an d ^("HnMi^- ) are almost surely finite and that (A(m)) m >i is a(0,oo)- 
valued sequence with A(m)/m — ► 1 as m — > oo. Then, as m -> oo, we /iave 

&(m) (x; A- m , fi) («!.(*) - « d ) (3-8) 

and 

(CAMfrA^fiUAMty;**,^ (3.9) 

Proof. Since O C fi^, C and fioo is open, for any K > 0, we have, for sufficiently 
large m, that 

0<6,( m )(x;* m ,IR d \r!) 

<ex( m )(x;A^,R ,i \B A(mr i/ < , x (a;))-^^(W B(a)j M ,i \BA-) ) (3.10) 

where the convergence follows from (3.6). By assumption, ^ T (W li ( x ),R' i ) is almost surely 
finite, so the limit in (3.10) itself tends to zero in probability as K — ► oo and therefore 

£A(m)(x; X m ,M. d \ fi) -^>0 as A — > oo. Combining this with the case A = M. d of (3.6) and 
using Slutsky's theorem (see, e.g., [13]), we obtain (3.8). 

A similar argument to the above, using (3.7), shows that as m — > oo, 

(&(m) (x; M d \ n), e A(m) (y; , R d \ fi)) -^(0, 0) 

and by using this with the case A = R d of (3.7) and Slutsky's theorem in two dimensions, 
we obtain (3.9). □ 

The next lemma compares the measure £a(x; X , •) to the corresponding point measure 
£?(x;Af, ■). In proving this, for / £ -B(fi), we write ||/||oo for sup{|/(x)| : x £ fi}. 

Lemma 3.5. Let x £ fio o/nd suppose that R(pc;T-C K t x \) and ^(7i K ( x ),R ) are almost 
surely finite. Let y £ M. d , with y ^ x. Suppose that f £ B(fi) and suppose either that f is 
continuous at x or that assumption A2 holds and x is a Lebesgue point of f . Suppose 
(A(m)) m >i is a (0, oo) -valued sequence with A(m) / 'm — > 1 as m — > oo. Then, asm— >oo, 

(/, £am (x; * m ) - £ (m) (x; * m )} (3.11) 

and 
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Proof. In this proof, we write A for A(m). The left-hand side of (3.11) is equal to 

(f(z)-f(x))£ x (x;X m ,dz). (3.13) 



Given K > 0, we split the region of integration in (3.13) into the complementary regions 
B x -i/d K (x) and M. d \ B x -i/d K (x). Consider the latter region first. By (3.6), we have 



l \B x -i, dK (*) 



(/(*)- /(z))6(x;* m ,d2) 



<2| 



^ x (^X m ,R d \B x - l/dK (x)) 



[ \B K ), 



where the limit is almost surely finite and converges in probability to zero as K — > oo. 
Hence for e > 0, we have 



lim lim sup P 



K d \B A -i/d R .(^) 



(f(z)-f(x))^;Xm,dz) 



> £ 



(3.14) 



Turning to the integral over B x -i/d K (x), we consider separately the case where / is 
continuous at x and the case where A2 holds and x is a Lebesgue point of /. To deal 
with the first of these cases, writing 4> e (x) for sup{|/(y) — f(x)\ :y G B £ (x)}, we observe 
that 



B x _ 1/dK (x) 



(f(z)-f(x))Z x (x;X m ,dz) 



< 4> x -i/d K (z)£a(x; X m ,fl). 



(3.15) 



If / is continuous at x, then <j) x -i/d K {x) — > 0, while £a( x ! <Y m , fl) converges in distribution 
to the finite random variable ^(Ti. K [ x ),l& d ), by (3.8), and hence the right-hand side of 
(3.15) tends to zero in probability as to — > oo. Combined with (3.14), this gives us (3.11) 
in the case where / is continuous at x. 

Under assumption A2, for Borel A C M. d , the change of variables z = x 
yields 



&(x;*,A) 



e^;x + X^ d (-x + X),y)dy 

x+\ 1 / d (-x+A) 

A / e'(x;x + A 1/d (-x + X) 1 x + \ 1 / d {z-x))dz. 



Hence, under A2, 



B x -i,d K (*) 

= A 



(f(z)-f(x))^(x;X m ,dz) 

(f(z) - (x; x + X 1/d (-x + X m ),x + X 1 / d (z - x)) d; 
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<K X \f(z)-f(x)\dz 

JB x _ 1/dK (x) 

and if, additionally, a; is a Lebesgue point of /, then this tends to zero. Combined with 
(3.14), this gives us (3.11) in the case where A2 holds and x is a Lebesgue point of /. 
The proof of (3.12) is similar; we use (3.7) and (3.9) instead of (3.6) and (3.8). □ 

By combining Lemmas 3.3 and 3.5, we obtain the following, which is the main ingre- 
dient in our proof of the Law of Large Numbers in Theorem 2.1. 

Lemma 3.6. Suppose (x, y) £ Qq x Qq, with x^y. Suppose, also, that i?(x; 'H K ,( X )), R(y, 

"^k(s/))> ^,x(^-K(x)^ d ) an d £%o('H-K(y)i^ d ) are almost surely finite. Let f S B(£l) and sup- 
pose either that Al holds, that A2 holds and x is a Lebesgue point of f , or that f is 
continuous at x. Suppose (A(m)) m >i is a (0,oo)-valued sequence with X(m)/m^ 1 as 
m — > oo. Then, as m — > oo, 

</, £a M (x; X m ))^ /(*)£, (H K{X) , R d ) (3.16) 

and 

(/,eAM(x;^))(/,eA( m) (y;^)) 

/(*)/(»)£, («„(*) , R d )^ (H K{y) , R d ). (3.17) 
Proof. Note, first, that by (3.8), 

(/, ei (m) (x; X m )) = f(x)Z Hm) (x; X m> fi) f(x)&(H <x) , R d ) (3.18) 
and similarly, by (3.9), 

((/,ei (m) (x;^)),(/,^ (m) (y;^))) 

^(/W^(W K(;c ),IR rf ),/(2/)^(H KCy) ,R d )). (3.19) 

In the case where Al holds, we have £a = £a> so (3-16) follows immediately from (3.18) 
and (3.17) follows immediately from (3.19). 

In the other two cases described, we have (3.11), by Lemma 3.5. Combining this with 
(3.18), we see, by Slutsky's theorem, that (3.16) still holds in the other two cases. Sim- 
ilarly, by (3.19), (3.12) and Slutsky's theorem, we can obtain (3.17) in the other cases 
too. □ 



4. Proof of general laws of large numbers 

In this section, we complete the proofs of Theorems 2.1 and 2.2, using the weak con- 
vergence results from the preceding section. Throughout this section, we assume that 
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(X(n)) n >i satisfy A(n) > and that X(n)/n — ► 1 as n — > oo. Also, let H K (x) denote a 
Cox point process in x M, whose distribution, given X = x, is that of Ti. K / x > (where 
X = (X, T) is as in Section 2). We first show that the conditions of Theorem 2.1 imply 
that t£' T) (H K ( x) ,R d ) is finite. 

Lemma 4.1. Suppose M.((x,T);7i K ^) < oo almost surely, for K-almost all x £ f2oo. // 
(2.5) holds for p = 1, then Coo' T ^ CW re ( x ) , is almost surely finite, for K-almost all x £ 

Proof. Given K > 0, define the random variables 

Sk ■= Cm(W k (x), B k )1q oo (X), Sk,u :=Ca(«)(X; X n ~i, B x ^-i/ dK (X)). 

By Lemma 3.3, for any bounded continuous test function h on R, as n — ► oo, we have 
almost sure convergence of E[/i(S'k ! „)|X] to E[H(Sk )|X]. By taking expectations and 
using the dominated convergence theorem, we have that E[h(SK,n)] — ► E[/i(Sa')]- Hence, 

SW.n — >-<Sk as n — > oo. Hence, by (2.5) and Fatou's Lemma, E[£V] is bounded by a 
constant independent of K. Taking K — > oo, we may deduce that ^(H K (x),IR !i )l J2=o (X) 
has finite mean and so is almost surely finite. The result follows. □ 

To prove Theorem 2.1, we shall use the following general expressions for the first two 
moments of (f,vx,n)- By (2.3), we have 

n- 1 E(/, ux, n ) = E(/, £a(X; X*-i)) (4.1) 

and 

n- 2 Var (/, ^ (n) J = n^E^f, f A(n) (X; A",^)) 2 ] 

+ E[(/, £ A(ri) (X; (/, £ A(n) (X'; 2 )>] 

-(E[{f^ Hn) (X;X n ^))}f. (4.2) 

Recall that by definition (2.1), £ x ((x,t); X ,R d ) = for x £ R d \ ft A , with (Q A , A > 1) a 
given non-decreasing family of Borel subsets of M. d with limit set fioo and f^oo Cj]CK d . 
In the simplest case, fi A = R d for all A. 

Proof of Theorem 2.1. Let / £ £>(S1). First, we prove (i) for the case q = 2. Assume 
that (2.5) holds for some p > 2. Set J := /(A)^(W K(x) ,R d )l noo (X) and let J' be an 
independent copy of J. By Lemma 4.1, J is almost surely finite. 

For any bounded continuous test function h on R, by (3.16) from Lemma 3.6, as m 
oo, we have E[/i((/,£ A („)(X; ,Y„-i)))|X] — > E[J|X], almost surely. Hence, E[h((f, £ A ( n )(X; 
Af„_i)))] ->E[/i(J)j so that 



(/) ^A(n)(X;Af n _i)) — > J. 



(4.3) 
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Similarly, using (3.17), we obtain 

(/,^ ( „ ) (X;-Y„ x : 2 ))(/,^(n)(X / ;A'^ 2 )} j'j. (4.4) 

Also, by (2.5) and the Cauchy-Schwarz inequality, the variables in the left-hand side of 
(4.3) and in the left-hand side of (4.4) are uniformly integrable, so we have convergence 
of means in both cases. Also, (2.5) shows that the first term in the right-hand side of 
(4.2) tends to zero. Hence, we find that the expression (4.2) tends to zero. Moreover, by 
(4.1) and the convergence of expectations corresponding to (4.3), n _1 E(/, v\( n ), n ) tends 
to E[J] and this gives us (2.6) with I? convergence, 

Now, consider the case q — 1. Assume (2.5) holds for some p > 1. First, assume / is 
non-negative. We use a truncation argument; for K > 0, let be the truncated version 
of the measure , defined by 

£f ((x, t);X, A) := min(£ A ((x, t); X, fl),K)l A (x). 

Let fl* be the set of x £ fio such that R((x,T);TC K ( x -)) and Coo(H K (a;):K d ) are almost 
surely finite. By Lemma 4.1, re(f2o \ f2*) = 0. 
Then, for x = (x,T) with x £ f2*, 

(/, g (n) (x; *„-i)} = f(x) min(£ A (x; X n - U 0), X) 

^/(x)min(^(H K(;c) ,R d ),X), (4-5) 

where the convergence follows from (3.8). Similarly, for distinct x,y in fT, setting x = 
(x,T) and y = (y,T'), by (3.9), we have that 

(/,^„ ) (x;^_ 2 ))(/,^„ ) (y;^ x _ 2 )) 

/ (x)/(y) min(£* (W„ (x) , K d ) , if ) min(£, (« b(j0 , M d ) , if) . (4.6) 

Using (4.5) and the same argument as for (4.3), we may deduce that 

{f,^ {n) (X;X n ^))^J K , (4.7) 

where we set 

J K :=f(X)mm(^(n K{x) ,m d ),K)l n (X). 
Likewise, using (4.6), we obtain 

(f,tf(X;X* 2 ))(f,g(X';X* 2 ))^J K J K , (4.8) 

where J' K is an independent copy of Jk- Also, since £f (-x.;X,n) is bounded by K, the 
distributional convergences (4.7) and (4.8) are of bounded variables, so the corresponding 
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convergence of expectations holds. Set 

n n 

»X,n :=J^f (X,;^), :=E^(X l5 *„). 

i=l i=l 

By following the proof of (2.6) with L 2 convergence, we obtain 

»~ 1 </,^{n) I n>-^EJ J 



K ■ 



(4.9) 



Also, 



0<E[n- 1 (/ J ^ (n)in >-n- 1 (/,i^ B)in )] 
= E[(/, C Kn) (X; X n -s) - £f (n) (X; *„_i))] 

< II/IIooIKaw (X; n)i{ex ( „) (x ; *„-i, n) > k}\, 

which tends to zero as K — - > oo, uniformly in n, because the moments condition (2.5), 
p > 1, implies that the random variables (X; X n -i, arc uniformly integrablc. Also, 
by monotone convergence, as K — > oo, the right-hand side of (4.9) converges to E[J]. 
Hence, taking K — > oo in (4.9) yields 



n _1 (/.^(n),n>-^E[J] 



(4.10) 



This gives us (2.6) with L 1 convergence when assumption Al holds, in the case where 
/ is non-negative; by taking positive and negative parts of / and using linearity, we can 
extend this to general /. 

Now, suppose A2 or A3 holds. Then, 



E 



1 JZ K/'^A(n)(x»;^) -C\( ra )( x i; x n )}\ 

i=l 

E[|(/,e A ( ri )(X;A'„_ 1 )-^ (n) (X;Af„_ 1 ))|]. 



(4.11) 



By (3.11), the variables |(/, £^( n )(X; X n -\) — £w„)(X; -f n _i))| tend to zero in probability 
and, by (2.5), they arc uniformly integrablc, so their mean tends to zero, that is, the 
expression (4.11) tends to zero and thus we have (2.7). Combining this with (4.10) gives 
us (2.6) for q = 1 when assumption A2 or A3 holds, completing the proof. □ 



Proof of Theorem 2.2. Suppose that R((x,T);H. K r x j) is almost surely finite for k- 
almost all x G 51oo- Suppose that \{n)/n — ► 1 as n — > oo and that there exists p > 1 such 
that (2.5) holds. By the case p = 1 of Theorem 2.1 (or, more directly, by the argument 
at the start of the proof of that result), we have convergence of means in (2.6). To derive 
almost sure convergence under condition (2.9), we loosely follow the argument from [17], 
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pages 298-299. For A > 0, define H x :\J™ =1 [(R d x M) r 
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by 



£/a(xi, . . . ,x„) := ^ {xi, ■ ■ • ,x„})}. 



Then, (/, = H\(K\, . . . , X„). Let Ti denote the a-field generated by Xi, . . . , X^ and 
let To denote the trivial cr-field. We then have the martingale difference representation 
(f,v\(n),n) -E(/,i/A(„),n) = E"=i d n where d t := E[(/, v X ( n ^ n )\Ti\ - ^A(n),n>l^i-l]- 
Notice that 

rfi = E[i/\(„) (Xi, . . . , X n ) — Hx( n ) (Xl, . . . , Xj_i, X', Xj+i, . . . , X„) J^]. 

By assumption (2.9), ||i? A („)(Xi, . . .,X„) —H X ( n )(Xi, . . .,X n _i)|| p / is bounded by a con- 
stant C, independent of n, and so, by Minkowski's inequality and exchangeability of 
Xi, . . . , X„, 

H#A(n)(Xi, . . . , X„) - H\(n)(Kl, . . . ,Xj_i, X ,X,+1, . . . ,X„)|| p , 

< 2 ||-ffA(n)(Xi, . . . ,X„) - H X ( n }(Xi, . . . ,X n _i)|| p , < 2C, 

so, by the conditional Jensen inequality, allowing the constant C to change from line to 
line, we have 



EKK < EE[|ff A(n) (X 1 , . . . , Xn) - #A(n)(Xl, ■ • • , X^, X', X i+1 , . . . ,X n )| p ' I Ti\ 



<c. 



(4.12) 



Choose 7 to satisfy 7 < 1/2 and p'7 > f3 + 1. By the condition p' > 2(/3+ 1), such 7 exists. 

We now use the following modification of Azuma's inequality, introduced by Chalker et 
al. ([4], Lemma 1). For any martingale difference sequence di, i>l, and for all sequences 
Wi, i> 1, of positive numbers, we have, for all t > 0, that 



P 



>t 



< 2exp 



l + 2^ 1 sup||d J || 00 ) ^2P[\dt\>Wi]. 



1=1 



Letting w; := n 7 , t := en, using (4.12) and Markov's inequality and noting that 
sup^ Halloo < Cn 13 by the first part of (2.9), we obtain, for any e > 0, that 



P 



i=l 



> £71 



< 2exp 



+ + 



which is summable in n by the choice of 7 (since we assume (3 > 1). Hence, by the 
Borel-Cantelli lemma, we have almost sure convergence for (2.6). 
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To prove (2.7) with almost sure convergence under assumption (2.10), define 
Hx:\JZi[( Rd x -M)"]^Kby 

n 

F A (x 1 ,... ! x„) :=53|(/, 6(n) (x,; {xi, . . . , x„}) - £J (n) (x, ; {x x , . . . , x„})) | 

i=l 

and then follow the same argument as given above, with H\ replaced by H\. 
Next, we show that (2.11) implies (2.9) and (2.10) for any / E B(S1). Since 

n-l 

Aa,„ (/) = (/, 6 (X„ ; X n )) + (/» £a (X, ;X n )- £a (X, ; X n _ x )) 

i=l 

and since, for any signed measure /i on SI with total variation we have (/, /z) < ||/||oo x 
it follows by the triangle inequality and definition (2.8) that A A .„(/) < H/HooA^n 
and hence (2.11) implies (2.9). 

Finally, we show that (2.11) implies (2.10). By definition, 

A' Ai „(/) = G Ai „(/)-G A , n _ 1 (/) 

= \(f,Zx(X n ;X n )-ex(X n ;X n ))\ 

n-l 

+ '%2(\{f,&{X i ;X n )-&(X i ;X a ))\ 

i=l 

-\{f,U^Xn-l)-ex{^Xn-l))\). (4.13) 

For any real 01,02,61,62, we have |(|ai —61 — \a-2 — i>2 1 ) | _! \ a i — ^2 + |6i — 62 1, by the 
triangle inequality, and using this, we can deduce from (4.13) that 

n-l 

|A' Ai „(/)|< 211/11^6 (X„;A , „ j n)+4||/|| 00 ^|^(X i ;^)|(Q). 
By definition (2.8), this is at most 4||/|| 00 A Ai „(/), so (2.11) implies (2.10). □ 

5. Applications of the general theory 
5.1. Voronoi estimation of a set 

The first example illustrating our general result is concerned with coverage of a set by 
Voronoi cells. Let SI := (0, l) d . For finite X C R d and x<EX, let V"(x; X) denote the closed 
Voronoi cell with nucleus x for the Voronoi tessellation induced by X, that is, the set of 
y G R d lying at least as close to x (in the Euclidean sense) as to any other point of X. 
Let V(x; X) be the intersection of V(x; X) with f2. Let k be a density function on SI, let 
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Xi, X2, ... be independent random d- vectors taking values in f2 with common probability 
density k and let X n = {Xi, . . . , X„} (in this section, boldface vectors represent unmarked 
points in R d ). 

Let A be an arbitrary Borel subset of ft. Let A n be the estimator of the (un- 
known) set A based on data from sensors at X n using Voronoi cells, that is, let 
A n '■= [J x ex nA ^( x i %n)- With a view to potential applications in nonparamctric statis- 
tics and image analysis, Khmaladze and Toronjadze [11] ask whether A n is a consistent 
estimator for A. More precisely, with | • | denoting Lcbesgue measure and A denoting 
symmetric difference of sets, they ask whether we have almost sure convergence 

|A n |— >A as n — > 00, (5-1) 
\AAA n \^0 asrwoo. (5.2) 

They answer these questions affirmatively only for the case d = 1 and comment that for 
general d, (5.2) is not hard to prove when A has Lcbesgue null boundary. Using our 
general results, we can answer these questions affirmatively without any assumptions on 
the boundary of A. 

Theorem 5.1. Suppose inf{K(a;) :x £ ft} > 0. (5.1) and (5.2) then hold almost surely. 

Note that (5.2) implies (5.1). We keep these results separate for presentational pur- 
poses. Actually, the question posed in [11] refers to the almost sure limits analogous 
to (5.1) and (5.2) for A^ n , where N n is Poisson with parameter n independent of 
(Xi,X 2 , . • .), but this clearly follows from our result since N n — > 00 almost surely. 

We work toward proving Theorem 5.1. Assume henceforth in this section that k is 
bounded away from zero on ft and set ft A = ft for all A. For finite X C R d and x £ X, 
let £(x; X,-) be the restriction of Lcbesgue measure to V(x;X). Thus, £ is translation 
invariant and points do not carry marks; also, £ has the homogeneity property of order 
d, which says that t;(ax;aX,aA) = a d £(x; X,A) for any a > 0. Combining this with the 
consequence (2.2) of translation invariance, we have, for all y:,X,A,X with x£ fi, that 

£ A (x; X, A) = A£(x; X, A) = X\V (x; X)C\A\. (5.3) 



Lemma 5.1. There is a constant C such that, for t > 1, 

supP[£ n (X;Af n _!,ft) >t] < Cexp(-i/C). (5.4) 

n>l 

Proof. Let d, 1 < i < I, be a finite collection of infinite open cones in R d with angular 
radius 7t/12 and apex at 0, with union M. d . For x G and 1 < i < I, let Cj(x) be the 
translate of Ci with apex at x. Let C, + (x) be the open cone concentric to Ci(x) with apex 
x and angular radius 7t/6. Let i?i jra (x) denote the distance from x to the nearest point 
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in X n r\Cf (x)nB diam ( Ci ( x )nf!)(x); if no such point exists, set P liM (x) := diam(C 4 (x) n ft). 
In other words, set 

Ri,„(x) := min(min{|Y — x| : Y € X n n Cf(x)}, diam(Ci(x) n fi)), 

with the convention that min(0) := +00. By elementary geometry, if Y £ X n nC 4 + (x), 
then V(x.;X n ) nCj^(x) C P| Y _ x |(x). Hence, V(x; #„) n C;(x) C S fii „(x). Therefore, re- 
calling that denotes the volume of the unit ball in R d , we have, by (5.3), for any 
x £ fi, that 

a(x;A , „_ 1 ,f2) = A|V(x;A' n _ 1 )|<u; d A max i?„^i(x) d . (5.5) 

1<2<7 

Let 7? := (1/2) sin(7t/12). Then, P[P ? ;,„(x) > s] = 0, unless there exists y G C;(x) n £1 with 
|y — x| = s. But, in this case, B vs {^^-) C C+(x) so that Pj jn (x) < s, unless B T , s (^ i ) 
contains no point of X n . Moreover, since G f2 and k is bounded away from zero 
on 0, there is a constant <5, independent of x, such that k(B vs (^^-)) > Ss d . Hence, for 
1 < i < I and all u > 0, 

P[Ri, n -x(x) d > u] < (1 - (ki)"- 1 < exp(-<5(n - 1». (5.6) 

By (5.5) and (5.6), 

/ 

PK n pt;^_i,n) > t] < ^P[P;,n-i(X) d > */(«><*»)] 
i=l 

< Iexp(— 5(n — l)t/(wdn)) 
and this gives us the result. □ 

Proof of Theorem 5.1. Let ^(x;<Y, •) be the point mass at x of size £\(x.;X,n), as 
defined at (2.4). Then by (5.3), for Borel A C R d , we have 

A- 1 ^„(A) = A- 1 Yl £A(x;* n ,Q)= J] |V(x;^)[ = |A.|. 

Let Cj, 1 < i < I, be as in the proof of Lemma 5.1. Then, Tt\ fl C; ^ almost surely for 
each i < 7; set Pi (A) := inf {|x| : x £ HxCiCi} and P(A) := 2max 1 < i <j P.; (A). The Voronoi 
cell around is unaffected by changes to TL\ outside P.r(a) and hence, with this choice 
of £, we have P(0; TL\) < 00 almost surely for all A > 0. Also, £^(7i,\,R d ) is the Lebcsgue 
measure of the cell centred at in the Voronoi tessellation of H\ U {0}. Since y lies in 
this cell if and only if Pi y i(y) contains no point of 7Ya, by Fubini's theorem, we have 

E[^(HA,R d )]= / P[HAnP| y |(y) = 0]dy 
= f cxp(-A^| y | d )dy = l/A. 
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Set X(n) = n for all n. By Lemma 5.1, the measure £ satisfies the moments condition 
(2.5). Also, £ satisfies assumption A2. Hence, setting / to be the indicator function 1a, 
we can apply Theorem 2.1 to deduce that 

n-^JA)^ f (l/«(x))«(x)dx = |A| J 



that is, we have (5.1) with L 2 convergence. 
With / = 1a, using (5.3), we have 

A" 1 (/, £ A (x; X) - S(x; X)) = A" 1 (&(x; X, A) - /(x)&(x; X , O)) 

= £(x;A\A)-/(x)£(x;*,Q) 
= (1 - /(x))£(x; X, A) - /(x)£(x; A 1 , fl \ A) 

and hence 

^A- 1 |(/,a(x;^)-a(x;A'))|= ]T £(x;A', J 4)+ £ £(x;*,fi\A) 

xGX x£X\A xEXnA 

= ]T \V(x;X)nA\+ J2 \VteX)\A\ 

xex\A xe^nA 



AA \J V(x;X) 



xGXnA 



= \AAA n \ 



Therefore, by applying the conclusion (2.7) of the general result in this particular case 
with X(n) = n, we obtain (5.2) with L 1 convergence. 

For the almost sure convergence, we demonstrate the condition (2.11) for the present 
choice of £. Observe that for 1 < i < n — 1, the signed measure £(X^; X n _i) — £(X^; X n ) 
is, in fact, a non-negative measure, namely the Lcbesgue measure on X^(X,;; X n —\) n 
V(X. n ; X n ), since this is the region (if any) removed from the Voronoi cell around Xj due 
to the addition of an extra point at X„. Thus, by (5.3), £a(X^; X n ^i) — £ A (Xj; X n ) is A 
times the same measure. Hence £ A (Xi; X n ) — £ A (Xi; X n -i) has no positive part and its 
total variation on f2 is 

|6(X i; X n ) - £ A (X,; = A|V(X i; Af„_ x ) n V(X„; 

Hence, by (2.8) and (5.3), 

A„ jrl = £n(X„; , fi) + |£„(Xj-; A^„) — £„(Xjj A'„_i)|(f2) 

i=i 

= 2n|y(X„;-Y„)| 
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and the third moments of this are bounded uniformly in n by Lemma 5.1. Thus, (2.11) 
holds here with f3 = 1 and p' = 3, so both (2.6) and (2.7) hold with almost sure conver- 
gence. □ 



5.2. Nonparametric regression: the Gamma test 

Suppose that Xi,i > 1, are independent random d- vectors with common density k. In 
a nonparametric regression model, consider real- valued random variables Yi, 1 < i < n, 
related to random d- vectors Xi, 1 < i < n, by the relation 

Fi=/i(Xi) + ei , l<i<n, (5.7) 

where /i G C 2 (M d ,M) and (e,,i > 1) are independent and identically distributed with 
mean zero and common variance er 2 , independent of {Xi)i>\. Both the function h and 
the variance a 2 are unknown and k may also be unknown. Often, it is of primary interest 
to estimate h, but here, we are concerned with estimating a 2 . 

Given k £ N and n> k, 1 < i < n, let j (i, n, k) be the index of the fcth nearest neighbour 
of Xi in the sample {Xi, . . . , X n }, that is, the value of j such that \Xi — Xi\ < \Xj — Xj| 
for precisely k — 1 values of £ G {1, 2, . . . , n} \ {i}. The so-called Gamma statistic discussed 
by Evans and Jones [5] (see also [10]) with parameter k is an estimator j n k for a 2 given 

by 

l " 

i=l 

For large n, we expect |^(i.„,fc) — Xi \ to be small, so we approximate to h(Yju nt k\) — h(Yi) 
by the first-order Taylor approximation Vh(Xi) ■ {Xj^ i n ^ — Xi). Under the proposed 
model, this approximation gives us 

1 " 

7n,fc - ^ 2 « o~ ^t( V ' ( X j(i,n,fc) - + e jU »,fc) - e,) 2 - 2a 2 ]. (5.9) 

i=l 

The mean of the last expression is ^K[(Vh(Xi) ■ {Xjt l n ^\ — Xi)) 2 ]. Following equation 
(2.9) of [5], define 

, _ E[(Vh(X 1 )-(X j{1 ^ k) -X 1 )) 2 } 

An - k '- 2El\X ]{hn , k) -X 1 \ 2 ] ■ (b - W > 

Evans and Jones [5] set 6k, n : = n^ 1 S"=i \Xj(i, n ,k) ~ Xi\ 2 and propose to estimate a 2 
by the intercept on the y-axis of a regression of y = against x = plotted for 
1 < k < fco, with, for example, fco = 20. They argue heuristically (see the discussion leading 
up to Theorem 2.1 of [5]) that for large n, the value of A n< k should be approximately 
independent of k and give the slope of the regression line. The following result proves the 
first of these assertions as an asymptotic statement since the right-hand side of (5.11) 
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below does not depend on k. In proving this, we shall give the asymptotic behaviour of 
the expected value of the expression in (5.9). 

We assume throughout this section that SI C M. d is bounded and open with «(S1) = 1, 
that |V/i| is bounded on f2 and that K,(B r (x))/r d is bounded away from zero, uniformly 
over x € Q.,0 < r < 1. The last condition holds, for example, if f2 is a finite union of 
convex sets and the density function n is bounded away from zero on f2. 

Theorem 5.2. As n — > oo, 

J Q K(xY d - 2 ^ d \Vh(x)\ 2 dx 
n ' k ^ 2df QK (x)( d - 2 )/ d dx " (5 - U) 

As n gets larger, one expects the Gamma estimator of a 2 to become more sensitive, 
thus one expects to be able to estimate smaller values of a 2 . In the next result, we allow 
the common variance of the Yi — h(Xi) to get smaller as n increases. More precisely, we 
modify (5.7) to 

Y itn = h(X i )+n- 1/d e i , l<i<n. (5.12) 

We consider an estimator for a 2 in this model using a linear regression of just two points 
arising from k = 1 and k = 2 (we could similarly consider any other two choices of k). Let 
the random variable denote the distance from the origin to its feth nearest neighbour 
in the point set H±. It is known (see [22], equation (19)) that 

n P t}=u;- 2/d T(k+(2/d))/T(k). 



Theorem 5.3. Let ^n.k be given by (5.8), with Yi — Yi^ n given by (5.12). Then, 



This result shows that the left-hand side of (5.13) (which is the intercept in linear 
regression of n 2 ^ d j n & against Epfc, based on just two values of k) is a consistent estimator 
of a 2 . 

In proving Theorems 5.2 and 5.3, we use the following notation. For locally finite 
X C M d and for x £ K d , let N\(x) be the fcth nearest neighbour of x in the set X \ {x}, 
making the choice according to an arbitrary rule in the event of ties and taking N% (x) = x 
if X \{x} has fewer than k elements. 

Lemma 5.2. Suppose b £ R d and A > 0. Then, 

E[|b ■ A^ A (0)| 2 ] = d-iA-^lbpE^]. (5.14) 
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Proof. Let = (0±, . . . ,0d) be uniformly distributed over the unit sphere in M. d , inde- 
pendent of pk- Since Yli=i @i = l®| 2 = 1> taking expectations, we have E[# 2 ] = 1/e? for 
each i. Given b, we have, for some unit vector e, that b = |b|e and hence 

E[(b • 6) 2 ] = |b| 2 E[|e • 6| 2 ] = d-^bl 2 , (5.15) 

where the last equality follows because, by rotational symmetry, the distribution of |e • 0| 2 
is the same for all unit vectors e and its expectation is 1/d whenever e is one of the unit 
coordinate vectors. 

By the distributional rotational symmetry of the homogeneous Poisson process and 
the fact that 7i\ has the same distribution as A _1 / d 7Yi for any A > (by the Mapping 

Theorem [12]), we have (0) = A^^p^O. Taking expectations and using (5.15), we 
have (5.14). □ 

Proof of Theorem 5.2. Let £(x; X) be a point mass at x of size (Vh(x) ■ (N x — x)) 2 
and let £l\ = Q for all A. Then, for x £ Q, £\(x;<-t) is a point mass at x of size 
\ 2 / d (yh(x) ■ (N x (x) — x)) 2 . By assumption, |V/i| is bounded on f2, so there is a constant 
C such that, for x G ft, 

\\7h(x).(N k x (x)-x)\ 2 <C\N x (x)-x\ 2 

and hence, for x G O, 

P[£ n {x\ A" n _i,0) >t]< P[\N k Xn i (x) -x\ 2 > t/{Cn 2 ' d )]. 

By assumption, K,(B r {x))/r d is bounded away from zero on < r < diam(r2), so there 
are constants C, C" such that for x G ft, n > 2k and (t/ '(Cn 2/d )) 1/2 < diam(ft), 

< cxp(-t d / 2 /0 

and this bound also holds for (t/(C*n 2 / d )) 1 / 2 > diam(ft) since, in that case, the prob- 
ability is zero. It follows that (2.5) holds here for any p. Therefore, we may apply 
Theorem 2.1, here taking / = 1 and fl\ = il for all A, followed by Lemma 5.2, to ob- 
tain 

n 2 / d E[(V/i(Xi) • (X i( i,„, fc )) -Xr) 2 ] = E[Z n (Xi;X n ,n)} =n- 1 E[{f, u n<n )} 

-> / E[(Vh(x) ■ Nfr (0)) 2 }K(x)dx 

Jn K<x) 

= <T 1 E[pjJ] f Kixy-W^lVhix^dx. (5.16) 
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As for the denominator of the expression (5.10) for A n< k, we have (see [6, 18, 22]) 
that 



n 2 



/ d E[|X i(i> „, fe) -Xrf] = rt^Cn^lX^fc) -X,|) 2 

3=1 

and combining this with the limiting expression (5.16) for the numerator, we obtain 
(5.11). □ 

Proof of Theorem 5.3. We give just a sketch. We must now consider marked points. 
The mark space is M. = R and the real-valued mark T(x) attached to a point x e R d is 
assumed to have the common distribution of e±, e 2l . . . (with mean zero and variance a 2 ). 
Given k and given x = (x,T(x)) with x G X and X a finite subset of M. d x Ad, we set 

£(x; X) = (Vh(x) ■ (N*(x) -x)+ T{N k x {x)) - T(x)f. 

This can be shown to satisfy the conditions of Theorem 2.1. We set e, = T(Xi) and 
Xi = {Xi, ei) and X n = {Xi, . . . , X n }. By first-order Taylor approximation, 

(2/d) — 1 n 

n 2/ % hk « ^— ^(V/i(X) • (X^, n , fe) - X) + n-^ d (e Mn:k) - e,)f 



1 " 

i=l 

1 " 1 

= 7T~ / j in{Xi,X n ) = — (/, V n n ), 

»=i 

where we put / = 1. By Theorem 2.1, this converges in L 1 to the limit 
\ f E[(Vh(x) ■ N HkM + T(N HkM ) - T(0)) 2 }k(x) dx 

|V/i(aO| a sO»0 1 ~ (2/ ' ,) dx, 



G 2 , E[^] 



2(7 



where we have used Lemma 5.2 and the fact that the marks T have mean zero, variance 

«(x)- 



a 2 and are independent of each other and of the point process Tt K r 



It then follows that we have the L 1 convergence 

n 2 / d ( 7n , 2 Ep 2 - j n>1 Ep 2 2 ) -> {Ep 2 - Ep 2 )a 2 
and this implies (5.13). □ 
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